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Description 

Background of the Invention 

5 During the chemical synthesis of multifunctional compounds, it is often necessary to use protecting 
groups so that selective chemical transformations can be performed. Ideally, a protecting group should 
allow for simple and efficient protection and subsequent regeneration of a functional group on the 
deprotected compound. Protecting groups which may irreversibly alter the functional group should be 
avoided. Moreover, the products should also be easily purified from side products generated during the 
10 synthesis or during the cleavage of the protecting group. 

Triphenylchloromethane (also known as triphenylmethyl chloride and trityl chloride) and related deriva- 
tives have long been used for the protection of hydroxy! groups, amino groups and thiol groups. See 
Greene, T.W., Protecting Groups in Organic Synthesis , John Wiley and Sons. NY (1981). The triphenyl- 
methyl cation is a sterically hindered electrophile. This property results in preferential reactivity of the trityl 

75 halide species with less hindered nucleophiles. In oligonucleotide synthesis, the 4,4'-d jmethoxytri phenyl - 
methyl-(4,4'-dimethoxytrityl) group and related 9-phenylxanthene-9-yl (more commonly known as pixy I) 
group are commonly used for the regioselective protection of the 5'-hydroxyl group of ribonucleoside and 
2-deoxyribonucleoside monomers. Characteristics such as preferred acid lability, hydrophobic character 
and 5'-regioselectivity make these two triphenylmethyl derivatives the protecting groups of choice in 

20 oligonucleotide synthesis. 

Alkoxy substitution of the phenyl rings is commonly used to increase the acid lability of the 
triphenylmethyl protecting group. Detailed investigation of trityl derivatives and the effects of substituents on 
acid lability have been performed. (Taunton-Rigby, A. et aj., J. Org. Chem. 37:956-984 (1972). Appropriate 
substitution affords the 4 ( 4'-dimethoxytriphenymethyl group and the 9-phenylxanthene-9-yl group of optimal 

25 acid lability for current oligonucleotide synthesis applications. Additionally, several acid stable trityl deriva- 
tives which retain the desirable hydrophobic character and 5'-regioselectivity have been prepared, such as 
4,4'.4--tris(4,5-dichlorophthalimido)-trityl (Sekine, M. and T. Hata, J. Am. Chem. Soc. 1 08:5764-5765 (1984)); 
4,4\4*-tris(levulinyloxy)trttyl (Sekine, M. and T. Hata. Bull. Chem. Soc. Jpn. , 58:336-339 (1985)); 4-(9- 
fluorenylmethyloxycarbonyl)oxy-4'.4*-dimethoxytrityl (Happ, E. and C.S. Happ. Nucleosides and Nucleotides 

30 7:813-816 (1988)); and 4-(9-fluorenylmethyloxycarbonyl)amino-4',4*-dimethoxytrityl (ibid. ). All these com- 
pounds are substituted triphenylmethyl derivatives containing protected phenol(s) or protected exocyclic 
amino group(s). While the phenol(s) or exocyclic amino group(s) remain protected, the trityl ether bond is 
fairly stable to acidic conditions. Upon hydrazinolysis of the levulinyl protecting groups, hydrazinolysis of 
the 4,5-dichlorophthalimido protecting groups or alkali catalyzed beta-elimination of the 9-fluorenylmethylox- 

35 ycarbonyl (Fmoc) group, the phenolic group(s) or exocyclic amino group(s) were all regenerated. The ether 
bond of the resulting trityl species can then be rapidly cleaved with mild acid. Trityl derivatives which have 
been previously described served strictly as protecting groups having unusual lability characteristics. 

Triphenylmethyl protecting groups with long chain alkyl substituents have been prepared as tools for 
affinity chromatographic purification of oligonucleotides. Seliger. H. and H.H. Gortz, Angew. Chem. 93:709 

40 (1981); Seliger, H. and H.H. Gortz, Angew, Chem. Inter. Ed. Engl. 20:683 (1981); Kwiatkowski, M. et al. . 
Acta. Chem. Scand. §38:657 (1984); Schmidt, G. et al.. Nucleosides and Nucleotides 7:795-799 *(1 988). By 
introducing triphenylmethyl protecting groups bearing long chain alkyl substituents to the 5'-hydroxyl 
terminus of oligodeoxynucleotides, stronger affinity of the full length DNA products for hydrophobic 
chromatographic supports can be achieved. Such derivatives are particularly useful for the purification of 

45 oligonucleotides of greater than sixty nucleotides in length. However, preparation of the long chain alkyl 
substituted triphenylmethyl derivatives and the four suitably protected synthetic monomers is difficult and 
labor intensive. These monomers are used for a single condensation reaction in an oligonucleotide 
synthesis. 

In recent years, non-isotopic labeling of oligodeoxynucleotides utilizing biotin and fluorophores has 
so become increasingly useful for the detection of DNA immobilized on solid supports (Beck, S. et aj.-, Nucl. 
Acids Res. 17:5115-5123 (1989)); Takahashi, T. et aj., Anal. Biochem. 179:77-85 (1989)); the immobilization 
of DNA to solid supports (Syvanen, A. et al., Nucl. Acids Res. 16:11327-11338 (1988)); Richardson R.W. 
and Gumport, R.I., Nucl. Acids Res. V[:61 67-61 84 (1983)); the affinity purification of DNA (Mitchell, L.G. and 
Merril, C.R., Anal. Biochem. 178 , 239-242 (1989)); Dawson, B.A. et aj., J. Biol. Chem. 264 . 12830-12837 
55 (1989)); and/or the sequencing of DNA (Beck, Ibid; Mitchell, et aj., Ibid.; Smith, L.M., Nature 321:674-679 
(1986)). Biotin and fluorophores have been incorporated into synthetic nucleic acid fragments by numerous 
chemical methods (Agrawal, S. et ah, Nucl. Acids Res. 14:6227-6245 (1986); Forster, A.C. et aj., Nucl. Acids 
Res. 13: 745-761 (1985); Coull. J.M. et al., Tett. Lett. 27: 3991-3994 (1986); Gibson, K.J. and Benkovic, S.J.. 
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Nucl. Acids Res. 15:6455-6467 (1 987). 

Chemical assembly of oligonucleotides is well established and has been performed by phosphodiester, 
phosphotriester. phosphoramidate. phosphoramidite and H-phosphonate methods. See Gait, M.J., 
Oligonucleotide Synthesis, A Practical Approach , IRL Press Inc., Oxford, England. The most common 

5 means by which oligonucleotides are produced is referred to as solid phase synthesis using the 2- 
cyanoethylphosphoramidites (Sinha, N.D. et a}., Nucleic Acids Research , 12:4539-4557 (1984); and U.S. 
Patent No. 4,725,677, issued to Millipore Corporation). Several suitably protected amino groups containing 
nucleoside derivatives (Haralambidis, J., et a}., Nucl. Acids Res. 15:4857-4876 (1987); Gibson, K.J., Ibid. , 
Smith, L.M. et aK. Nucl. Acids. Res. 13:2399-2412 (1985)), thiol group containing nucleoside derivatives 

70 (Sproat, B.S., et aj., Nucl. Acids Res. 15:4837-4848 (1987)) and 5'-terminal linkers (Agrawal, S., Ibid. ; Coull, 
J.M. et ah, Ibid. ; Connolly, BA, Nucl. Acids Res. 15:3131-3139 (1987); Blanks, R. and McLaughlin, L.M., 
Nucl. Acids Res. 16:2659-2669 (1988); Connolly. B.A. and Ridge, P., Nucl. Acids Res. 13:4485-4502 (1985)) 
have been prepared for DNA labeling applications. All of these synthons are easily incorporated into 
oligodeoxy nucleotides during chemical assembly. All of these linkers can be further reacted to introduce a 

75 label into the oligonucleotide. The linkers, however, cannot be removed to generate a natural (unmodified) 
oligonucleotide. 

The polymerase chain reaction (PCR) (U.S. Patent No. 4,683,202 issued to Cetus Corporation) is a 
method used to exponentially amplify a nucleic acid sequence in vitro . The method uses two short 
oligonucleotide primers which are complementary to different strands of a DNA template and flank the 

20 region of the nucleic acid sequence to be amplified. Using a thermostable DNA polymerase and repeated 
cycles of template denaturing, primer annealing and primer extension, it is possible to rapidly prepare large 
quantities of a defined nucleic acid sequence from a few or even a single template copy. However, the 
flanking sequences must first be determined in order to prepare complementary primers. 

Primers used in PCR will be incorporated at the 5 '-hydroxyl terminus of the amplified products. If one 

25 (or both) of the primers are labeled in such a manner as not to interfere with primer annealing and primer 
extension, it is inevitable that the amplified product will contain the label. It would be desirable to chemically 
assemble oligonucleotide primers in which the label can be later removed to yield unmodified DNA. 

The immobilization of PCR amplified products to a solid support has been performed by using biotin- 
labeled primers and a streptavidin agarose support (Mitchell, L.G., et al „ Anal. Biochem. 1 78:239-242 

30 (1989)). When a single labeled primer is used, it was possible to denature the immobilized double-stranded 
product and isolate the single-stranded sequence extended from the unlabeled primer. However, the affinity 
of the biotin-streptavidin complex is so great (Ko is reported to be 10~ 1S ) that it is essentially impossible to 
remove the biotin-labeied strand from the support, thus interfering with isolation of the double-stranded 
product. Moreover, based on the limitations afforded by current oligonucleotide synthesis and labeling 

35 chemistries, it would be desirable to have a method in which biotin can be cleaved from the amplified DNA 
so that the unmodified double-stranded product can be obtained. 

. Summary of the Invention 

40 This invention pertains to heterobi- or oligo-functional protecting groups in which at least two func- 
tionalities can regioselectively or chemoselectively bind to functional groups on a natural product, 
biopolymer or synthon for a natural product or biopolymer (such as, an oligonucleotide, nucleic acid, 
nucleoside, nucleotide, amino acid, monosaccharides, oligosaccharides, peptides, proteins, carbohydrates, 
lipids, steroids or alkaloids). The protecting groups provide a means to reversibly attach a modifying group 

45 to the natural product, biopolymer or synthon. The biopolymer, natural product or synthon can be regene- 
rated in original form by simply removing the protecting group. In general, the protecting group comprises a 
regioselective or che mo selective functionality and one or more additional functionalities which can bind to a 
modifying moiety, selected from the group consisting of a detectable label, a biologically active molecule or 
a compound for aiding in the purification of the natural product, biopolymer or synthon, provided that the 

50 functionality is not a phenol (hydroxyl) group or an arylamino group and the modifying moiety is not 9- 
fluorenylmethoxycarbonyl (Fmoc), levulinoyl or 4,5-dichlorophthalimido. Preferably, the protecting group is a 
triphenylmethyl derivative, such as 4,4 , -dimethoxytriphenylmethyl or pixy I. 

The invention also pertains to reversibly modified natural product, biopolymer and synthon which can 
be represented by the formula: 

55 

L-P-C* 

wherein C* is a natural product, biopolymer or synthon for a biopolymer or natural product; 
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P is a protecting group bound to a functional group of C* which can be removed from the protected 
functional group under conditions such that the functional group is regenerated; and 

L is a functionality for bonding a modifying moiety to P. provided that L is not a phenol (hydroxyl) group 
or an arylamino group. 

5 The heterobi- or oiigofunctional protecting groups of this invention can be used as a vehicle for 
reversibly attaching a natural product, biopolymer or synthon to the modifying moiety. Although one or 
more functionalities on C are protected by P, one can perform a variety of chemical manipulations. For 
example, single-stranded and double-stranded nucleic acids can be purified and subsequently isolated in 
their natural form. Likewise, several natural products can be simultaneously purified and separated from 

io each other using the methods and compounds described herein. In one embodiment, modified 
oligonucleotides can be used as primers for polymerase chain reactions. 

Brief Description of the Drawings 

75 Figure 1 is a schematic representation of the synthesis of a modified tri phenyl methyl protecting group 
and a nucleoside protected therewith. 

Rgure 2 is an high performance liquid chromatography (HPLC) analysis of crude partially protected 
fluorescein labeled polymerase chain reaction (PCR) primer 1. 

Rgure 3 is is a reversed phase HPLC analysis of a crude mixture comprising seven partially protected 
20 oligonucleotides. 

Rgure 4 is an ethidium bromide stained electrophoretic gel obtained from the analysis of various 
samples. 

Detailed Description of the Invention 

25 

This invention pertains to compounds which serve as both a protecting group for protection of a 
functional group on a natural product, biopolymer or synthon for a natural product or biopolymer; and a 
linking group for attaching a modifying moiety thereto. The compounds of this invention are heterobi- or 
oiigofunctional protecting groups in which at least one functionality can regtoselectively or chemoselectively 

30 bind to a functional group of the natural product, biopolymer or synthon. They, however, can be removed 
from the protected functional group under conditions such that the original functional group is regenerated. 

Heterbi- or oiigofunctional protecting groups can be represented by the formula L-P, where P is a 
protecting group for protection of a functional group on the natural product, biopolymer or synthon. For 
example, P can be used to protect hydroxyl, amino or thiol functionalities. The protecting group is capable 

35 of being removed from the protected functional group under conditions such that the original functional 
group is regenerated. Preferably, P is a regioselective protecting group, such as trityl, 4-monomethoxytrityl, 
4,4'-dimethoxytrityl or pixyl. However, other protecting groups can be used depending upon the functionality 
on the natural product, biopolymer or synthon to be protected. P can also be benzyl (e.g., methoxybenzyl, 
nitrobenzyl, alkoxy benzyl, dialkoxy benzyl); benzyloxycarbonyl (e.g., methoxylbenzyloxycarbonyl); benzylox- 

40 ymethyl (e.g., nitrobenzyloxymethyl, methoxybenzyloxymethyl); alkoxycarbonyl (e.g., t-butoxycarbonyl); 
alkoxymethyl (e.g.. methoxymethyl); alkylsilyl (e.g., t-butyldimethylsilyl); arylsilyl (e.g., triphenylsilyl); ben- 
zoyl (e.g., nitrobenzoyl. alkoxybenzoyl); phenoxyacetyl; or alkoxyacetyl. Each of these protecting groups 
can be substituted. For example, a preferred protecting group for peptide synthesis is 9-fluorenylmethylox- 
ycarbonyf. 

45 L is a functionality on the protecting group for bonding or linking a modifying moiety thereto, provided 
that L is not a phenyl (hydroxyl) group or an arylamino group. Reaction, extension, additional functional 
group incorporation or labeling can be performed at this site since L is a reactive group. 

This invention also pertains to reversibly modified natural products, biopolymers or synthons which can 
be represented by the formula: L-P-C* where C* represents a synthetic biopolymer, synthon, natural product 

so or any modification of these. C* can be a nucleoside, nucleotide, oligonucleotide, nucleic acid, amino acid 
peptide, protein, monosaccharide, oligosaccharide, carbohydrate, steroid, lipid or alkaloid. Attachment of a 
modifying moiety to L results in a compound represented by the formula M-L-P-C, where M is the 
modifying moiety. Upon subjecting the product M-L-P-C* to conditions suitable to remove the protecting 
group, the original deprotected functional group of compound C* will be generated along with the biproduct 

55 M-L-P. 

The modifying moiety M represents either the original modifying moiety or any modification thereof. M 
can be a means for detecting and/or purifying C*. such as by affinity or non-affinity purification methods. M 
can also serve as a means for attaching C* to a solid support. Accordingly, M can be an alky! moiety, a 
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hydroxyalkyl moiety, a carboxyaJkyl moiety, an aminoalkyl moiety, a thioalkyl moiety, a detection label 
(such as a radioisotope, fluorophore, luminescent compound, che mi luminescent compound or biotin), an 
affinity or purification handle (such as long alkyl chains, polymers or biological molecules), a biologically 
active molecule (such as a peptide, a protein, a nucleoside, a nucleotide, an oligonucleotide, a nucleic acid, 

5 a sugar molecule, an oligosaccharide, carbohydrate, steroid, lipid or alkaloid) or a polymer, provided that M 
is not 9-fluorenylmethoxycarbonyl (Fmoc), levulinoyl or 4,5-dichlorophthalimido. For example, when the M is 
a detection label, it is possible to detect the complex M-L-P-C by an appropriate means. Deprotection 
regenerates the compound C" and the labeled biproduct M-L-P. Removal of the protecting group may or 
may not be performed after the separation, isolation, detection and/or purification of the complex. 

w When M is a compound for aiding in the purification or immobilization of C* (such as a purification or 
affinity handle) it is possible to pass the complex M-L-P-C" over a solid support to selectively adsorb or 
covalently attach the complex to the support for the purpose of immobilization and/or purification. In the 
case where the complex M-L-P-C* can be eluted from the support, it is possible to collect the purified 
product. Removal of the protecting group under predefined conditions will generate C* and the biproduct M- 

75 L-P. When the complex M-L-P-C" is strongly immobilized to the support via the modifying moiety M, the 
new species will be defined as S-M-L-P-C* where S is defined as the support. C* can then be separated 
from the biproduct S-M-L-P under conditions known to remove the protecting group, such as by acid 
treatment. 

One or more chemical or enzymatic manipulations of compound C* of the complex M-L-P-C", can be 
20 performed to prepare a modified compound defined as C*. Although C* has been previously defined to 
include modified versions of C*. for illustration purposes C~ represents one or more modifications. The 
resulting complex, defined as M-L-P-C*, is capable of yielding C* and a biproduct M-L-P upon exposure of 
the complex to conditions sufficient to remove the protecting group. Similarly, one or more chemical or 
enzymatic manipulations can be made to M of complex M-L-P-C*. Such manipulations can be made prior to 
25 or after C* is chemically or enzymatically manipulated. Likewise, M-L-P can be removed from C" or CT upon 
condition dependent removal of the protecting group. 

A preferred protecting group is a triphenylmethyl derivative which is modified such that it can bind to C* 
for protection thereof and to a modifying group (such as a label) at one or more other functional groups on 
the phenyl moieties. Generally, heterobi- or oligofunctional triphenylmethyl protecting groups of this 
30 invention are represented by the formula: 



35 




L 



40 



45 For example, when C* is a nucleoside, nucleotide or oligonucleotide, the triphenylmethyl protecting group 
(P) can be bound to C" at the 5'-hydroxyl group, the nucleoside base or the 3'-hydroxyl group. Preferably, P 
is attached to the 5'-hydroxyl group. C* can be cleaved from the complex at the protecting group to thereby 
yield the original unprotected biological compound. Thus, biological compounds of interest can be removed 
from labels or other modifying moieties without altering their structures due to the removal procedure. 

so Preferred triphenylmethyl derivatives and methods for making them are described in detail below. 

In one embodiment, a substituted triphenythydroxy methyl derivative containing a single (or multiple) 
exocyclic carboxylic acid, sulfonic acid, cyano, nitro or other functional group(s) of the general formula (la) 
is prepared. Likewise, a substituted triphenylhydroxymethyl derivative containing a single (or multiple) 
exocyclic alkyl carboxylic acid, alkyl sulfonic acid, cyanoalkyl, nitroalkyi or other alkyl functional group(s) of 

55 the general formula (la) is prepared. 
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5 



10 




la 



wherein A1-A15 are the same or different and are selected from the group consisting of H, R, OR and Z, 
provided that there is at least one Z group; 

20 R is an alky) having one to 20 carbon atoms (e.g., methyl, ethyl, 2-propyl, butyl, t-butyl, 2-cyanoethyl), 
which is optionally substituted by one or more heteroatoms (such as a cyano, nitro or halo group; or a 
substituted or unsubstituted aryl (e.g., phenyl t-butyl phenyl); and 

Z is -(CH2) n C(0)OH t -(ChbJnSOaH, -(CH^NCfe, -(CH^CN, -(CH^OH, -(CH 2 )„NH 2 or -(CH 2 ) n SH where 
n is an integer from zero to 20; provided that when n is zero, Z is a group other than -OH or -NH2. 

25 A preferred derivative of the above compound is a 4' ,4"-dimethoxytriphenyihydroxy methyl shown in 
Figure 1 (compound lb). According to Figure 1, the compound can be synthesized by reacting a Grignard 
reagent (prepared from 2-(4~bromophenyl)-4,4-dimethyl-1 ,3-oxazoline and magnesium) with a substituted or 
unsubstituted benzophenone to produce a substituted or unsubstituted triphenylmethyl derivative. This 
compound is then sequentially treated with aqueous acid, base and acid to yield the compound lb. 

30 The exocyclic functional group(s) of the compounds of the general formula (la) is converted to a 
compound of general formula I la. In one embodiment, the exocyclic functional group is converted to a 
compound having an exocyclic electrophilic functional group, such as N-hydroxysuccinimidyl ester, 2- 
nitrophenyl ester, 4-nitrophenyl ester, 2,4-dichlorophenyl ester, any active ester, an acyl halide, acyl azolide, 
alkyl halide, a sulfonyt halide or any reactive halide derivative. Preferably, the compound is N-succinimidyl- 

35 4-[(bis(4-methoxyphenyl)-hydroxyrnethyl]benzoate (lib) shown in Figure 1 . 




55 wherein AVA'is are the same or different and are selected from the group consisting of H. R, OR, Z 
and L, provided that there is at least one L group; 
R and Z are defined above; 

L is -(CH 2 ) n C(0)W f -(0^)0802 W, -(CH 2 ) n W. where n is an integer from zero to 20; and 
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W is selected from the group consisting of CI. Br, I, -NCS, -NCO, 



s 




10 

and D1-D5 are the same or different and from H. F, CI, Br. I. -NO2 and -CN. 

A substituted triphenylhalomethyl or triphenyltetrafluoroboranomethyl derivative containing an exocyclic 
reactive site(s) of the genera! formula Ilia can then be prepared from compounds of the general formula Ha. 
For example, a triphenylhalomethyl or triphenyltetraftuoroboranomethyl derivative containing at least one 

15 additional exocyclic electrophilic functional group can be prepared. These compounds are regioselective 
dielectrophilic triphenylmethyl derivatives. Preferable exocyclic electrophilic functional groups include N- 
hydroxysuccinimidyl ester, 2-nitrophenyl ester, 4-nitrophenyl ester, 2,4-dichlorophenyl ester, any active 
ester, an acyl halide. acyl azolide, alkyl halide. a sulfonyl halide or any reactive halide derivative. In a 
preferred embodiment, the compound is N-succinimidyh4-(bis(4-methoxyphenyl)-chloromethyl]- benzoate 

20 (lllb) shown in Figure 1. 



25 



30 




Ilia 



40 wherein A'i-A'15 are defined above and X is a leaving group, such as CI, Br, I or BF*. 

Regioselective protection of a functional group on a natural product, biopolymer or synthon for a natural 
product or biopolymer is achieved by preferential reactivity at the stericaily hindered cationic site of 
compounds having the general formula Ilia, to form a compound of general formula IVa. For example, the 
5'-hydroxyl group of a ribonucleoside or ^-deoxyribonucleoside is preferentially protected with the triphenyl- 

45 methyl derivatives of general formula Ilia, to yield partially protected nucleoside derivatives having the 
general formula IVa. 



7 



EP 0 424 819 B1 



10 




is 



IVa 



wherein AVA'is are defined above, and C* is selected from the group consisting of a nucleoside, 
nucleotide, oligonucleotide, nucleic acid, amino acid, peptide, protein, monosaccharide, oligosaccharide, 
20 carbohydrate, lectin, lipid, steroid, alkaloid, and a biopolymer. 

Preferred compounds of the formula IVa are N-succinimidyl-4-{bis(4-methoxyphenyl)-5'-0-(2'-deox- 
yribonucleosidyl)-methyl]-benzoate (IVb1-4; Figure 1) and N-succinimidyl-4-[bis(4-methoxyphenyl)-5'-0- 
(ribonucleosidyl)-methyl}-benzoate. These compounds can be represented by the formula IVc. 



25 



30 



35 



CHjO 



OCH, 




40 wherein B is a nucleoside base which may be protected by a base protective group which can be 
eliminated (KCster, H. et al ., Tetrahedron 37: 363-369 (1981), and can be selected from the group consisting 
of: 



45 



50 



55 
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O 

HN ' 



Q 



10 





t l OCX 1^- <Ia £ 



20 



o o o 



25 



HN ^R HN . HN ^0R 



30 Q is H or methyl; 
G is H f OH, OR. 



R is defined above. 

40 To prepare fully protected nucleoside derivatives, the ff-hydroxyl group of the partially protected 
nucleoside derivatives of general formula IVc are reacted with an activated phosphorus containing com- 
pound. Fully protected nucleoside derivatives which result from this reaction are represented by formula Va. 
Compounds of this formula are suitable for subsequent condensation with a hydroxyl group under the 
conditions commonly used for oligonucleotide synthesis. Activated phosphorus compounds which can be 

45 used include but are not limited to 2-cyanoethylphosphoramidites (U.S. Patent No. 4,725,677), O-methyl 
phosphoramidites (U.S. Patent No. 4,458,066). H-phosphonate synthons. phosphotriester synthons, and 
phosphodiester synthons. Two examples of fully protected nucleoside derivatives are N-succinimidyl-4-[bis- 
(4-methoxyphenyl)-5'-0-(3'-0-(N,NKiiisopro^ 

benzoates and N-succinimidyl^[bis(4-methoxyphenyl)-5'-0-(3'-0-(N,N-diisopropylamino-2-cyanoethyl- 
50 phosphinyl^-deoxyribonucleosidylhmethylhbenzoates shown in Figure 1 (compound Vbi-*). 



55 
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10 



is 



20 



25 




K G 



OCH, 



Va 



wherein K is H, OH, 



1 ill i 1 

o o o o o o 



1 OH CH | I OH OR 



R 



and R, B and G are defined above. 

Fully protected compounds such as those shown in Figure 1 can be further extended prior to attaching 

30 a modifying moiety to the protecting group. For example, these compounds can be condensed with the 5'- 
hydroxyl group of a partially protected and assembled oligonucleotide, either in solution or by solid phase 
methods, to prepare fully protected oligonucleotides having a reactive linking group(s) (L) on the protecting 
group (P). The protecting group, which bears the reactive linking group(s) (L), is thereby attached to the 5'- 
hydroxyl terminus of the oligonucleotide. 

35 Protected compounds of the general formula L-P-C* can be modified at the reactive functionality (L) of 
the protecting group/linking group P-L The product of said modification has been previously defined as M- 
L-P-C", where M is the modification produced by one or more manipulations at the reactive functional group 
(L) of the protecting group or any manipulations of an original modification. 

In one embodiment, the compound C* is a nucleoside, an oligonucleotide or a nucleic acid and the 

40 . protecting group/linking group moiety P-L is a triphenylmethyl derivative of formula Ilia. In another 
embodiment, compound C* is a support bound oligonucleotide attached to a 4,4 , -dimethoxysubstituted 
triphenylmethyl protecting group/linking group P-L (preferred embodiment of the general formula lllb) by an 
acid labile ether bond. M is a group as previously defined. 

Compounds of the general formula M-L-P-C" can be deprotected to generate C" and the biproduct M-L- 

45 P. In the case where C* is an oligonucleotide, it is possible to further manipulate compound C* of the 
complex M-L-P-C*. In one embodiment, the C* is a fully protected and support bound oligonucleotide, P-L is 
a heterobifunctional triphenylmethyl derivative of the general formula Ilia (the oligonucleotide is protected by 
P as an acid cleavable ether), and M is a modifying moiety which can include the groups previously 
described and which may or may not be protected. The complex M-L-P-C* can be cleaved from the support 

so and partially deprotected using methods commonly employed in oligonucleotide synthesis. In a final step, 
the modified protecting group linker P-L, can be removed to generate a fully deprotected oligonucleotide 
and a biproduct having the general formula M-L-P. 

Partially protected oligonucleotides can undergo several other manipulations prior to complete de- 
protection. These include further manipulations of M and/or immobilization of the oligonucleotide to a 

55 support via the modification M. In a preferred embodiment* P-L is a heterobifunctional triphenylmethyl 
derivative of the general formula Ilia (the oligonucleotide is protected by P as an acid cleavable ether), and 
M is a hydroxyalkyl moiety, a carboxyaJkyl moiety, an aminoalkyl moiety or a thioalkyl moiety. In another 
preferred embodiment, C* is a partially protected oligonucleotide. P-L is a heterobifunctional triphenylmethyl 
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derivative of the general formula 1Mb (the oligonucleotide is protected by P as an acid cleavable ether), and 
M is an aminoalkyl moiety. 

According to the method, an oligonucleotide can be labeled via the hydroxyl group, carboxyl group, 
amino group or thiol group of the modification M. Suitable detection labels include a radioisotope, 
5 fluorophore, luminescent compound or chemiluminescent compound, biotin, an affinity or purification handle 
(such as long alky I chains or polymers) or a biologically active molecule (such as a peptide, a protein, a 
nucleoside, a nucleotide, an oligonucleotide, a nucleic acid, a monosaccharide, an oligosaccharide, steroid, 
lipid, alkaioid or a carbohydrate). Preferably, the detection label is biotin or a fluorophore. 

The complex M-L-P-C* can also be immobilized onto a support via M. For example, M can be a 

10 carboxyalkyl moiety, an aminoalkyl moiety or a thioalkyl moiety. The carboxyl moiety of the complex (where 
M is a carboxyalkyl moiety) can be activated by a water soluble carbodiimide and coupled to a support 
containing nucleophiles. Likewise, the amino-containing complex (where M is an aminoalkyl) can be 
covalentiy immobilized to supports containing electrophilic chemical functional groups, such as 
isothiocyanate, isocyanate, acyl halide, sulfonyl halide, acyl imadozolide, acyl N-hydroxysuccinimide, alky I 

75 halide and active ester containing supports. The thiol-containing complex (where M is a thioalkyl moiety) 
can be immobilized by reaction with the above-described chemically reactive supports or by passing it over 
a commercially available mercury containing support as has been previously described for thiol-containing 
oligonucleotides (Blanks, B. et al.. Nucj. Acids. Res. 16:10283-10299 (1988)). 

An advantage to immobilization of a complex containing the triphenylmethyl protecting group/linking 

20 group P-L to a support is that products can be selectively adsorbed to the support and subsequently 
cleaved from the protecting group. Upon adsorption of the complex, all unbound impurities can be washed 
from the support. Compound C* can then be removed from the support in pure form under conditions in 
which the complex M-L-P remains immobilized (adsorbed) to the support. 

In a preferred embodiment, M is biotin attached to the triphenylmethyl protecting group/linking group 

25 (P-L) via a spacer molecule, such as an alky I chain. Attachment of biotin to M of the protected 
oligonucleotide can occur after partial deprotection and cleavage from the solid support; however, it is 
preferable to perform the biotin ylation while the oligonucleotide is support bound. The resulting biotinylated 
complex can be efficiently immobilized to a commercially available avidin or streptavidin support as has 
been previously described (Coull. J.M. et al.. Tett. Lett. 27:3991-3994 (1986)). 

30 In one embodiment of this invention, two or more compounds of interest can be simultaneously 
separated and purified from a mixture comprising two or more compounds of interest and impurities, and 
from each other. This purification method is herein referred to as multiplex purification. According to the 
method, compounds of the general formula L-P-C* can be prepared as previously described. For illustration 
purposes, C can be defined as a fully protected oligonucleotide A (L-P-A), and a fully protected 

35 oligonucleotide B (L-P-B). Preferably, the protecting group/linker P-L is a triphenylmethyl derivative of the 
general formula Ilia (the oligonucleotide is protected by P as the 5'-terminal trityl ether). White support 
bound, the reactive site L of the 5-terminaJ protecting group P of each of the fully protected 
oligonucleotides (L-P-A and L-P-B) can be further modified. For simplicity, the modifications are defined as 
Mi and M2 respectively, so as to prepare compounds of the general formula Mi -L-P-A and M2 -L-P-B. Mi 

40 and M 2 are affinity or purification handles having an affinity for a support S, such as alkyl chains of differing 
carbon lengths. Preferably, the affinity or purification handle M2 should have a significantly greater affinity 
for the support S than does the affinity or purification handle Mi . In the optimal case, the modification M 2 is 
a significantly longer alkyl chain than is the modification Mi . Additionally, the components of the mixture 
and compounds A and B should have little or essentially no affinity for the support S. 

45 Reversed phase high performance liquid chromatography (HPLC) is typically used for the purification of 
partially protected oligonucleotides. In principle, the triphenylmethyl group of the partially protected and 
fully assembled products will interact more strongly with the hydrophobic stationary phase (commonly C18 
coated silica) than do the impurities. The terminated sequences and other impurities will, therefore, elute 
quickly from the column so that the partially protected oligonucleotide product will be well separated and 

so can be collected as it e lutes from the column. See McLaughlin, L.W. and Piel, N., Oligonucleotide 
Synthesis, A Practical Approach (1984), Gait, M.J. (ed) IRL Press Inc.. Oxford, England, pp. 199-218. 

When the mixture comprising and partially protected oligonucleotides Mi -L-P-A and M2 -L-P-B and 
impurities, is subjected to reversed phase HPLC separation conditions, the components will elute in their 
respective order and can be collected, each in purified form. Once separated, the protecting groups can be 

55 removed to generate the fully protected and purified oligonucleotides A and B and the biproducts of general 
formula P-L-M1 and P-L-M2. 

Compounds of this invention can also be used in polymerase catalysed extension reactions. According 
to the method, an oligonucleotide (e.g., 10-30 nucleotides in length) is prepared such that it is complemen- 
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tary to and, therefore, will hybridize to, a single-stranded nucleic acid template. Once hybridization occurs, a 
DNA polymerase will extend the primer using the nucleic acid as a template to prepare the complementary 
nucleic acid sequence. Labeled primers can be used in polymerase catalyzed extension reactions provided 
that they are not inhibited from annealing to a template by virtue of the attached label and provided that the 

5 functional group from which the extension reaction proceeds is not made inaccessible by the incorporation 
of the label. A preferred variation of the polymerase extension reaction is the polymerase chain reaction 
(PCR; U.S. Patent 4,683,202). Polymerase catalyzed primer extension reactions of the 3'-terminal hydroxyl 
group can also be performed. 

The product of a primer extension reaction can be manipulated by any of the previously described 

to methods. For example, if the modification M is a detection label, it is possible to detect the complex by an 
appropriate means, such as by fluorescence, radiolabeling, chemtluminescence, or luminescence. If the 
complex is an affinity handle such as biotin, it is possible to immobilize it to an avidin or streptavidin 
support. The completely unmodified double-stranded product can likewise be generated at any time by 
removal of the protecting group. Additionally, a double-stranded DNA product of nucleic acid synthesis can 

is be denatured to prepare single stranded DNA. 

The polymerase chain reaction requires two primers which are complementary to different strands of a 
nucleic acid template and flank the region of the template to be amplified. As with any primer extension 
reaction, the product of PCR is a double-stranded nucleic acid. Modified oligonucleotide primers of the 
general formula M-L-P-C* can be used as amplification primers in the PCR process. Partially protected 

20 double-stranded nucleic acids generated by primer extension can be further modified according to the 
methods previously described. If two (2) labeled oligonucleotide primers are used in the polymerase chain 
reaction, the double-stranded product will contain labels on the 5' ends of the different strands. This product 
can also undergo all the same reactions described herein for any partially protected oligonucleotide, 
including condition dependent removal of the protecting groups to generate ah unmodified double stranded 

25 nucleic acid. 

The invention will be further illustrated by the following non-limiting Examples. 



30 Synthesis of 2-[4-(bis-(4-methoxyphenyl)-hydroxymethyl)-phenyll-4,4-dimethyloxazoline 

To 350 mmol of 2-(4-bromophenylM.4-dinnethyl-1 ,3-oxazoline (A. I. Meyers et aL, J. Amer. Chem. Soc. 
92: 6646-6647 (1970)) dissolved in 1 liter of freshly distilled tetrahydrofuran was added 17.0 g (700 mmol) 
of magnesium. The solution was warmed until the reaction initiated and thereafter was heated gently for 1 

35 nr. An extra one liter of tetrahydrofuran was added at this time. 345 mmol of 4,4'-dimethoxy-benzophenone 
was added and the coupling reaction stirred for 2 hrs at gentle reflux. The reaction was filtered and 
concentrated to approximately 500 ml. The solution was poured slowly into a flask containing 8% aqueous 
KHSO4 (1L) and diethyl ether (1L). The layers were separated and the organic fraction was washed once 
with H 2 0, dried (MgSO*), filtered and evaporated to yield 147.2 g of orange oil. The purified product was 

40 obtained by crystallization from benzene or ethyl acetate. 105.4 g (252 mmol, 73%) It. green crystal mp. 
140-142 'C 1 H-NMR (CDCb): 5 = 1.36 (s, 6H, -CH 3 ); 2.82 (S, 1H, O-H); 3.79 (s. 6H, -OCH3); 4.09 (s. 2H, 
CH 2 ); 6.80-7.17 (dd, 8H, CH 3 0-Ar-H); 7.32-7.89 (dd, 4H, oxaz-Ar-H) 



To 200 mmol of 2-[4-(bis-methoxyphenyl)-hydroxymethyl)-phenylH i 4-dimethyloxazol!ne was added 400 
ml of 80% aqueous acetic acid and the solution was stirred at 60-70 *C for 6-7 hrs. The product was 

so concentrated to an orange oil and redissolved in 500 ml of 20% NaOH in ethanol/water, 1/1 (v/v). This 
solution was refluxed vigorously for one hour and then concentrated to a white semi solid. The residue was 
dissolved in 1 L of H 2 0 and acidified with 3 M HC1 to pH 1.0. The solid was filtered off and dissolved 750 
ml of ethyl acetate. This solution was washed two times with dilute acid, dried (MgSO*) filtered and 
evaporated to yield 81 2. g of an orange foam. The crystalline product was obtained from benzene. 

55 67.49 g (185 mmol, 93%) orange crystalline solid mp. 105-1 07 *C 



'H-NMR (CDCI3): a = 3.80 (s. 6H, OCH 3 ); 6.80-7.20 (dd, 8H, CH 3 0-Ar-H); 7.42-8.06 (dd, 4H, HOOOAr-H) 



Example 1 



Example 2 



45 



Synthesis of 4-cartx>xy-4 / -4*-dimethoxytriphenylhydroxymethane (Figure 1 ; Compound lb) 
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Example 3 

Synthesis of N-succinimidyl^[bis(4^ethoxyphenyl)-hydroxymethyl}-ben2oate (Figure 1; Compound Mb) 

s To 6 mmol of lb and 9 mmol of N-hydroxysuccinimide (NHS) in 10 ml of ethyl acetate stirring at 0*C 
was added 7.5 mmol of N.W-dicyclohexylcarbodiimide (DCC). The reaction was stirred at 0"C for 2 hrs, 
then an additional 0.5 mmol of DCC and 1 mmol of NHS was added. After 4.5 hrs at 0 • C the reaction was 
filtered and diluted with ethyl acetate to a volume of 50 ml. This solution was washed with water four times, 
5% aqueous NaHCOa once and again with water. The organic layer was dried (MgSO*), filtered and 

10 evaporated to yield 70.0 g of yellow foam. The purified product was obtained by crystallization from ethyl 
acetate. 

2.01 g (4.4 mmol, 73%) white solid (two crystalline forms) mp. 168-1 70 *C or 186-1 88 *C 

'H-NMR (CDCI 3 ): 6 = 2.77 (s, 1H, OH); 2.90 (s. 4H, CH 2 ); 3.81 (s. 6H, OCH 3 ); 6.82-7.17 (dd, 8H, CH 3 OAr- 

H); 7.48-8.10 (dd. 4H, succinimidyl-Ar-H) 

75 

Example 4 

Synthesis of N-succinimidyl-4-[bis-(4-methoxyphenyl)-chloromethylhbenzoate (Figure 1; Compound lllb) 

20 To 50 mmol of lib was added 250 ml of acetyl chloride. The solution was boiled for three hours and 
cooled. 350 ml of anhydrous diethyl ether was added and the mixture was placed overnight at 5 • C. The 
white crystalline product was collected by vacuum filtration. 21.35 g (44.5 mmol, 89%) white crystalline 
solid mp. 203-204 * C 

'H-NMR (CDCfe): B = 2.91 (s. 4H, CH 2 ); 3.82 (s. 6H, OCH 3 ); 6.81-7.17 (dd, 8H. CH 3 0-Ar-H); 7.42-8.10 (dd. 
25 4H, succinimidyl-Ar-H) 

Example 5 

General procedure used to prepare N-succinimidyl-4-[bis-(4-methoxyphenyl)-5 , -0-(2'"deoxynucleosidyl)- 
30 methylhbenzoates (Figure 1; Compounds IVbi -♦) 

To 7 mmol of suitably protected 2 / -deoxynucleoside which had been dried by coevaporation from 
pyridine was added dropwise a solution containing 8 mmol of lllb dissolved in 25 ml pyridine. The reaction 
was stirred until complete (3-12 hrs) and quenched by the addition of 1 ml of methanol. The solvent was 
35 removed and the residue partitioned between 50 ml of ethyl acetate and 50 ml of 5% aqueous NaHCOa. 
The organic layer was washed with another portion of 5% aqueous NaHC0 3 and once with H2O prior to 
being dried (MgSCU), filtered and evaporated. The final product was obtained by crystallization from the 
described solvent. 

IVbi. 4.34 g (6.3 mmol, 90%) white crystal (benzene) 1 H-NMR (CDCI 3 ): 6 = 1.58 (d, 3H, CH 3 ); 2.20-2.47 
40 (m. 2H, H2\H2"); 2.64 (s, 1H, OH); 2.88 (s, 4H, CH2); 3.30-3.45 (m, 2H, H5',H5*); 3.79 (d, 6H. OCH); 
4.04-4.09 (m, 1H. H4'); 4.52 (m, 1H, H3'); 6.36 (dd, 1H, H1'); 6.83-6.87 (dd, 4H, Ar-H); 7.23-7.30 (dd, 4H, 
Ar-H); 7.44 (d, 1H, H6); 7.58-7.62 (d, 2H, Ar-H); 8.04-8.08 (d. 2H, Ar-H); 8.87 (s. 1H, N-H) 
IVIfc. 3.08 g (3.9 mmol, 55%) opaque solid (toluene) 1 H-NMR (CDCI 3 ): 5 = 2.51-2.63 (m, 1H, H2*); 2.85 
(s, 4H, CH2); 2.89-3.03 (m, 1H. H2*); 3.32-3.47 (m. 2H, H5',H5'); 3.78 (d, 6H, OCH 3 ); 4.17-4.23 (m, 1H, 
45 H4'); 4.69-4.76 (m, 1H, H3*); 6.44-6.50 (t, 1H, H1'); 6.78-6.86 (m f 4H, Ar-H); 7.12-7.29 (m, 5H, Ar-H); 7.48- 
7.60 (m, 4H, Ar-H); 7.94-8.06 (m, 4H, Ar-H); 8.14 (s. 1H. H8); 8.70 (s, 1H, H2); 9.15 (s, 1H, N-H) 
IVfe. 3.34 g (4.3 mmol, 62%) opaque solid (toluene) 'H-NMR (CDCI 3 ): 5 ■ 2.19-2.32 (m, 1H, HZ); 2.67- 
2.79 (m, 1H, H2*); 2.87 (s, 4H, CH 2 ); 3.42-3.52 (m. 2H, H5\ H5'), 3.81 (d, 6H, OCH 3 ); 4.15-4.19 (m, 1H, 
H40; 4.47-4.55 (m, 1H, H3'); 6.23-6.29 (t. 1H, HV); 6.86-6.92 (m, 4H. Ar-H); 7.19-7.32 (m, 5H, Ar-H); 7.50- 
50 7.62 (m. 5H, Ar-H); 7.86-7.90 (m, 2H. Ar-H); 8.04-8.09 (m, 2H, Ar-H); 8.18 (d. 1H, H6) 
IVb*. 4.16 g (5.3 mmol, 76%) white crystal (ethyl acetate) 

'H-NMR (DMSOds): 6 = 1.12 (d. 6H, CH 3 ); 2.30-2.42 (m, 1H, HZ)\ 2.70-2.84 (m, 2H, CH, H2*); 2.88 (s, 
4H, CH2); 3.10-3.27 (m, 2H. H5'. H5'); 3.72 (d, 6H, OCH 3 ); 3.96 (m, 1H, H4');4.42 (m, 1H, H3*); 5.36 (d, 
1H, OH); 6.27 (t, 1H, HV); 6.76- 6.87 (dd, 4H, Ar-H); 7.18-7.23 (dd, 4H f Ar-H); 7.60 <d. 2H, Ar-H); 7.97 (d, 
55 2H, Ar-H); 68.15 (s, 1 H, H8) 
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Example 6 

General procedure for the preparation of N-succinimidyh4*[bis-4>(methoxyphenyl)-5 , -0-(3 , -0-<N ( N- 
diisopropylamino^^yanoethylphosphinyl)-^ (Figure 1 ; Compounds 

5 VB,-0 

To 3 mmol of the appropriately protected monomer (IVbi-4) dissolved in 15 ml of tetrahydrofuran, was 
added 12 mmol of diisopropylethyiamine and 3.3 mmol of 2-cyanoethyldiisopropylaminochlorophosphine. 
The reaction was stirred at ambient temperature for 1-2 hrs, filtered and concentrated. The residue was 
10 dissolved in 100 ml of ethyl acetate, washed 3 times with 10% aqueous Na2C0 3 , dried (NcfeSO*) filtered 
and evaporated. The product was dissolved in 15 ml of ethyl acetate and dripped into 250 ml of hexanes. 
The precipitate was collected and dried. 

Vbi. 2.10 g (2.4 mmol. 79%) 31 P-NMR (CDCI 3 ): 6 = 145.550, 145.630 PPM 
Vb2. 2.53 g (2.5 mmol. 84%) 3, P-NMR (CDCI3): 6 = 145.494, 145.677 PPM 
rs Vbs. 1.93 g (2.0 mmol, 66%) 3, P-NMR (CDCI3): 5 = 145.604, 145.820 PPM 
VI*. 2.46 g (2.5 mmol, 84%) 3, P-NMR (CDCI3): 8 = 144.478, 145.519 PPM 

Example 7 

20 Oligonucleotide synthesis 

Oligonucleotides were assembled from commercially available 2-cyahoethylphosphoramidites either 
manually (Example 7a) or with an automated synthesiszer. A prototype large scale DNA synthesizer at 30 
umol scale was used in Example 7b whereas a Milligen/Biosearch Model 7500 DNA synthesizer running the 

25 standard 1 umol protocol was used in Example 9. In all cases, the final condensation was performed with 
the appropriate modified trityl protected 2-cyanoethylphosphoramidite (Vbi -4) to yield fully protected, 
support bound oligonucleotides having a 5'-terminal NHS ester linking group (L). The resins (or portions of 
the resins) were treated with amino group(s) containing compounds to functionalize the 5'-terminal NHS 
ester linking group (L) according to Example 8. Further extension or labeling could be performed on resin 

30 bound fully protected ^-functional group containing oligonucleotides according to the method described in 
Example 9 or the oligonucleotides could be removed from the support, partially deprotected. and reacted in 
solution according to Example 11. 

Example 7a 

35 

Synthesis of a thymidine dimer at 15 umol scale 

A commercially available 15 umol DMT-Thymidine-Succinyl-AP-CPG resin (Milligen/Biosearch, Division 
of Millipore, Burlington, Ma.) was detritylated with 3% dichloroacetic acid in dichloromethane until the eluent 

40 was colorless. The resin was washed with 5 ml of acetonitrile and dried under high vacuum. The resin was 
washed again with 5 ml of dry acetonitrile and then 950 ul of 0.085 M thymidine phosphoramidite (Vbi in 
Figure 1) in dry acetonitrile and 2 ml of commercially available tetrazole activating solution were mixed and 
slowly pushed through the resin over a period of four minutes. The resin was washed with 5 ml of 
acetonitrile and then 6 ml of commercially available oxidation solution was pushed through the column 

45 over a period of two minutes. Finally the resin was washed with 5 ml of acetonitrile and dried under high 
vacuum. Portions of the resin were treated with amino group(s) containing compounds to functionalize the 
5'-terminaJ NHS ester linking group (L) according to Example 8. The 5'-modified oligonucleotides were then 
partially deprotected and removed from the support according to Example 10. Table 1 summarizes the 
various compounds reacted with L and the yield of product as determined by HPLC analysis of the crude 

50 5'-modified thymidine dimer. 



55 
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Table 1 



Amino group(s) containing compounds used to modify the 5-terminal NHS ester linking group (L) of the 

thymidine dimer. 


Amine 


Retention Time* 


Area % 


cyclohexylamine 


40.37 


99 + 


1-aminohexane 


43.02 


99 + 


1 ,3-diaminopropane 


29.15 


78.5 


1 ,4-diaminobutane 


29.60 


73.3 


1 ,6-diaminohexane 


31 .24 


73.0 


1 ,8-diaminooctane 


33.88 


70.3 


1 ,1 2-diaminododecane 


40.78 


71.0 


3-amino-l ,2-propanediol 


29.85 


99 + 


2-amino-l ,3-propanediol 


27.81 


72.0 


6-aminohexanol 


34.43 


99 + 


6-aminocaproic acid 


30.83 


91.9 



"Reversed-phase HPLC analysis: Delta Pak C18-100A Liquid Chromatography Column (Waters 
Division of Millipore) 

Buffer A - 100 mM triethylammonium acetate pH 6.8, 
Buffer B = 95/5 acetonitrile/water 

Gradient: 0 min. (5% B); 50 min. (60% B); Flow rate = 1.0 ml/min; Temperature 40 *C 



Example 7b 

Synthesis of 5'-TCCCAGTCACGACGT -3* at 30 umol scale 

A 30 umol synthesis of 5'- TCCCAGTCACGACGT -3* was conducted using a prototype large scale DNA 
synthesizer running the protocol described in Table 2. The final condenstaion reaction was performed using 
compound Vbi to introduce the reactive NHS ester linking group (L) at the 5'terminus of the support bound 
oligonucleotide. 

At the end of the synthesis a portion of the resin was treated with 1 ,6-diaminohexane according to 
Example 8. The amine functional ized oligonucleotide was removed from the resin and partially deprotected 
as described in Example 10. Solution phase reaction of the 5'-terminal amino group with N-hydroxysuc- 
cinimidyl-biotin is described in Example 1 1 . 

The remainder of the resin was treated with various alkyl amines according to Example 8. The partially 
protected 5'-modified oligonucleotides were obtained by treating the resins according to Example 10 and 
were used to demonstrate multiplex purification in Example 1 3. 
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Table 2 



Large scale DNA synthesis protocol. 



Function 



w 



DCA wash 
ACN wash 
TEA wash 
ACN wash 
Condensation 
ACN wash 
b oxidation 
ACN wash 



Infuse Rate (ml/min) Duration 



7000 

7000 

7000 

7000 

N/A* 

7000 

7000 

7000 



(sec) 



240 

120 

60 

300 

300 

240 

30 

180 



75 



"For each condensation reaction a tenfold excess of 
2-cyanoethylphosphoramidite is dissolved in enough 2.5% 1 H-tetrazole in 
acetonitriie (w/v) to prepare a 200 mM solution. The activated amidite is 
infused and recirculated through the column bed for the period indicated. 



20 



25 



30 



35 



Example 6 

Aminolysis of the N-hydroxysuccinimidyl ester (Linker Group L) with alkylamines, alkyldiamines, hydroxyal- 
kyl amines or carboxyalkyl amines 

After the solid phase chemical assembly of an oligonucleotide, the support was placed in vacuo to 
remove any residual solvent. Typically, 2 ml of a 0.5-1 .0 M solution of amino group(s) containing compound 
in 75% aqueous dioxane was pushed through the resin over a period of two minutes. Compounds not 
soluble in 75% aqueous dioxane were dissolved as described in Table 3. The support was then washed 
with 2 ml of tetrahydrofuran and 2 ml of acetonitriie. The resin was subsequently dried in vacuo and was 
treated with ammonia as described in Example 10 or reacted further as demonstrated in Example 9. 
Reversed phase high performance liquid chromatographic (HPLC) analysis was used to determine the 
purity of all crude partially protected products. All analytical separations were performed with a 3.8 mm x 
150 mm Delta Pak C18-100A chromatography column. 

Table 3 



40 



45 



Variations on conditions used to derivatize NHS esters. 


Amine 


Solvent 


Concentration 


1 -ami nod odec arte 


THP 


1.0 M 


1 -aminopentadecane 


THF 


0.5 M 


1 -aminooctadecane 


THF 


0.4 M 


1 ,8-diaminooctane 


9/1 THF/water 


1.0 M 


1 , 1 2-diaminododecane 


95/5 dioxane/water 


0.5 M 


6-aminocaproic acid 


1/1 dioxane/5% aqu.NaHCOa 


0.5 M 



THF is an abbreviation for tetrahydrofuran 



50 



Example 9 

Support bound synthesis of biotin and fluorescein labeled PCR primers 

Two oligonucleotides complementary to opposite strands of a bacteriophage lambda DNA were 
chemically assembled according to Example 7. The sequences prepared were 5'-GATGAGTTCGTGTCCG- 
TACAACTGG-3' and 5'-GGTTATCGAAATCAGCCACAGCGCC-3'. These oligonucleotides would serve as 
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primers for the amplification of a 500 bp segment of lambda DNA using the polymerase chain reaction 
(PCR) (Example 15) and are referred to as PCR Primer 1 and PCR Primer 2, respectively. 

After completion of the 1 umol synthesis of each primer the 5'-terminal NHS ester of the resin bound 
protected oligonucleotides was reacted with 1,12-diarninododecane as described in Example 8. Both resins 

5 were then split into three unequal portions. 

One half of the resin from each synthesis was dried under high vacuum and treated with ammonia 
according to Example 10. Both crude samples were purified according to the method described in Example 
12 and a portion of each purified sample was detrityiated according to the method described in Example 
14. This produced both the completely deprotected and purified PCR primers (hereafter referred to as 

10 Control PCR Primer 1 and 2 respectively) required for Example 17. Each sample was shown to be pure by 
reversed-phase HPLC analysis 

One quarter of the remaining resin from each primer synthesis was exposed to one milliliter of a 0.25 M 
solution of N-hydroxysuccinimidyl-biotin dissolved in dimethylformamide/diisopropylethylamine/ water, 7/2/1 
(v/v/v) for a period of 30 minutes. Similarly, the remaining one quarter of the resin from each primer 

75 synthesis was exposed to one milliliter of a 0.125 M solution of di-0-pivaloyl-5-(N-succinimidyl)-fluorescein 
dissolved in dioxane/diisopropylethylamine/water, 7/2/1 (v/v/v) for a period of 60 minutes. Once the labeling 
reactions were complete, the support was washed with 2 ml of dimethylformamide, 2 ml of tetrahydrofuran. 
2 ml of acetonitrile and finally dried under high vacuum. Partially protected labeled oligonucleotides were 
obtained by treatment of the resins containing the fully protected and labeled oligonucleotides according to 

20 Example 10. 

Figure 2 is the HPLC analysis of the crude partially protected fluorescein labeled PCR Primer 1 . Both 

ultraviolet absorbance at 260 nm and fluorescence (inset) at 470 nm (410 nm exitation wavelength) were 

recorded. The analysis shows that the major component eluting at 34.4 minutes is the only fluorescent 

product. The crude sample was purified according to the method described in Example 12 and is herafter 
25 referred to as Fluor-PCR Primer 1 . HPLC analysis of the crude sample of fluorescein labeled PCR Primer 2 

(hereafter called Fluor-PCR Primer 2) was similar to that observed for Fluor-PCR Primer 1. It was similarly 

purified by reversed-phase HPLC. 

Reversed phase HPLC analysis of the crude biotin labeled Control PCR Primers (hereafter referred to 

as Bio-PCR Primer 1 and 2, respectively) was also very similar. The major component of each sample was 
30 isolated by the method described in Example 12. The presence of biotin in the oligonucleotides was 

confirmed by the binding of the purified products to a streptavidin agarose support (Coull. J. M. et aj, Tet. 

Lett. 27: 3991-3994 (1986)). Bio-PCR Primer 1 and Bio-PCR Primer 2 were used in polymerase chain 

reactions as described in Example 17. 
The HPLC conditions were: 
35 Buffer A = 100 mM triethylammonium acetate pH 6.8; 

Buffer B = 95:5 acetonitrile/water; Gradient: 0 min (5% B). 50 min (60% B); Flow rate = 1.0 ml/min; 

Temperature: 40 • C 

Example 10 

40 

Partial deprotection and removal of oligonucleotides from solid supports 

Once the oligonucleotide was functional ized/labe led as desired, the dried resin was treated with 0.5 ml 
of concentrated ammonia at 55 *C for 8-10 hrs. The resin was removed by filtration and washed with water. 
45 The filtrate and washings were combined and concentrated to dryness. The crude oligonucleotide was 
dissolved in 1 ml of deionized water. The yield and concentration of oligonucleotide were estimated from 
the absorbance at 260 nm of 10 ul of the sample diluted to 1.0 ml with water. 

Example 11 

so 

Solution phase labeling of oligonucleotides 

Labeled oligonucleotides were also be obtained by the solution phase reaction of functional group 
containing partially protected oligonucleotides. The oligonucleotide 5'-TCCCAGTCACGACGT -3* was pre- 
ss pared according to Example 7b and treated with 1 .6-diaminohexane to functionalize the NHS ester linking 
group L according to the method described in Example 8. After cleavage and partial deprotection of the 
oligonucleotide according to the method described in Example 10, the 5'-terminaJ amino group of the 
partially deprotected oligonucleotide was biotinylated as follows: 
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To 16 milligrams of N-hydroxysuccinimidyl-biotin dissolved in 500 ul of dimethylformamide was added 
250 ul of 0.1 M 4-(2-hydroxyethyl>-1-pipera2ine-ethanesulfonic acid (HEPES) pH 7.7 and 50 A260 units of 
crude 5'-amino group containing partially protected 15-mer in 250 ul of water. Aliquots of the reaction 
mixture taken at 5 minutes, 55 minutes and 120 minutes were analyzed by reversed-phase HPLC. The 
5 analysis indicated complete disappearance of the major component of the crude mixture in favor of a more 
hydrophobic product within two hours. No other components of the mixture were observed to react within 
two hours. This new product was isolated by preparative scale reversed-phase HPLC according to the 
method described in Example 12 and was shown to bind to streptavtdin agarose (Couli, J. M. et aJ. Ibid ). 

The HPLC elution conditions were: 
io Buffer A = 100 mM triethylammonium acetate pH 6.8; 

Buffer B = 95:5 acetonitrile/water; Gradient: 0 min (5% B), 40 min (40% B); Flow rate = 1.0 ml/min; 
Temperature: 40 • C 

Example 12 

Preparative purification of partially protected oligonucleotides 

Oligonucleotides were purified by preparative scale HPLC. Separations were achieved using a 7.8mm x 
300 mm Delta Pak C18-300A Liquid Chromatography Column (Waters, Milford MA). The column eluent was 

20 collected in approximately one milliliter fractions in tubes containing 50 ul of diisopropylethylamine and 50 
Ul of unbuffered 20 mM tris(hydroxymethyl)aminomethane (Tris). Product fractions were concentrated to 
dryness under vacuum, the residue was dissolved in sterile water and the contents of the tubes combined. 
The material was again concentrated to dryness, and the residue dissolved in sterile water. Aliquots of 
samples to be used in polymerase chain reactions were adjusted to a concentration of 20 nmol/mL. Elution 

25 conditions for preparative HPLC separations were optimized for individual isolations. 

Example 13 

Multiplex purification of partially protected oligonucleotides 

30 

Multiplex purification was defined as the simultaneous purification of two or more compounds. Since it 
is possible to manipulate the 5'-terminus of a fully protected oligonucleotide as described in Example 8, it is 
also possible to predetermine the affinity of the resulting partially protected oligonucleotide for an affinity 
support. Attachment of alkyl chains to the triphenylmethyl derivative results in selective alterations to the 

35 affinity of the partially protected oligonucleotides for hydrophobic supports. The common support used in 
preparative scale reversed-phase high performance liquid chromatography (HPLC) is C18 coated silica for 
which oligonucleotides have little affinity. 

The effect of alkyl chain length on the retention of partially protected oligonucleotides is demonstrated. 
In this example, portions of the resin containing the fully protected sequence 5'-TCCCAGTCACGACGT -3* - 

40 (from Example 7b) were treated by the method described in Example 8 with seven different alkylamines of 
general formula H2N(CH2)„CH3 where n = 2, 5, 7, 9, 11, 14, or 17. The resin aliquots were mined and 
treated with ammonia as described in Example 10. Figure 3 is the reversed-phase HPLC analysis of the 
components of the crude mixture. Each of the seven partially protected oligonucleotides was baseline 
separated from the others by several minutes and the products eluted in order of increasing alkyl chain 

45 length. The identity of the peaks was confirmed by coelution with independently isolated authentic samples. 
The HPLC conditions used were: 
Buffer A = 100 mM triethylammonium acetate pH 6.8; Buffer B = 95:5 acetonitrile/water; Gradient: 0 min 
(10% B), 5 min (20% B), 50 min (55% B), 60 min (60% B), 65 min (60% B); Row rate = 1.0 ml/min; 
Temperature: 40 • C 

so In another example of multiplex purification, two partially protected oligonucleotides of differing length 
and sequence were simultaneously purified. This was performed by derivatization of the NHS ester linking 
group (L) of two fully protected oligonucleotides with alkyl chains of differing length according to the 
procedure described in Example 8. A 30-mer of nucleotide sequence 5'-AATTCATAAGGTAATTCAAAATGT- 
TTGTCA-3' was treated with 1-aminodecane and a 24-mer of nucleotide sequence 5'- ACTCCCGGCCCCC- 

55 GGGCCTCCACC -3' was treated with 1-aminohexane. Analytical scale HPLC of the crude partially protected 
products obtained according to Example 10 was performed. The major component of the 1-aminodecane 
derivatized 30-mer eluted at 33.8 minutes and comprised 60.6% of the sample as determined by integration 
of the peak areas. The major component of the 1-aminohexane derivatized 24-mer eluted at 24.0 minutes 



18 



EP 0 424 819 B1 



and comprised 58.8% of the sample as determined by integration of the peak areas. These two major 
products having significantly different retention times were baseline resolved in a simultaneous preparative 
scale purification according to Example 12. Removal of the 5'-terminai modified protecting group according 
to Example 14 gave the purified fully deprotected oligonucleotides whose identity was confirmed by 
5 coelution with independently isolated authentic samples. 
The analytical HPLC elution conditions used were: 
Buffer A = 100 mM triethylammonium acetate pH 6.8; 

Buffer B = 95:5 acetonitrile/water; Gradient: 0 min (1 0% B), 5 min (20% B), 50 min (55% B), 60 min (60% 
B), 65min (60% B); Flow rate = 1 .0 ml/min; Temperature: 40 • C 
10 The preparative HPLC elution conditions were: 
Buffer A = 100 mM triethylammonium acetate pH 6.8; 

Buffer B = 95:5 acetonitrile/water; Gradient: 0 min (10% B), 5 min (20% B), 50 min (55% B); Flow rate = 
3.0 ml/min; Temperature: 60 • C 

75 Example 14 

Complete deprotection of partially protected oligonucleotides 

When the complete deprotection of modified trrtyl containing oligonucleotides was desired, the samples 
20 were evaporated to dryness and dissolved in 100 ul of 60% aqueous acetic acid. After two hours at 0* C 

the samples were evaporated to dryness and dissolved in a known volume of water. 

This procedure was carried out on one half of each of the amino group containing Control PCR primer 

sequences described in Example 9. This yielded fully deprotected unmodified (natural) primers for the 

polymerase chain reactions described in Example 17. 
25 Complete removal of the 5'-terminal modified trityl group by the aqueous acid was assayed by reversed 

phase HPLC. Analysis of the purified partially protected 1.12-diaminododecane modified PCR Primer 1 

described in Example 9 indicated a single compound which eluted at 30.7 minutes. Following acid 

treatment no starting material remained and a single peak corresponding to the fully deprotected 

oligonucleotide was observed to elute at 1 5.5 minutes. 
30 The HPLC analysis conditions were: 

Buffer A = 100 mM triethylammonium acetate pH 6.8; 

Buffer B = 95:5 acetonitrile/water; Gradient 0 min (5% B), 50 min (60% B); Row rate = 1.0 ml/min; 
Temperature: 40 • C 

35 Example 15 

Conditions used For PCR reactions 

Polymerase chain reactions (PCR) were comprised of 200 ul of 50 mM KCI, 10 mM buffer (Tris pH 8.3 
40 or N-2-hydroxyethyl piperazine-N'-2-hydroxy propane sulfonic acid (Heppso) pH 8.7 as indicated), 1.5 mM 
MgCb and 200 mM each dNTP containing 10 units of AmpliTaq™ DNA polymerase, 200 pmol each 
primer, and 2 to 20 fmol bacteriophage lambda DNA template. Two different amplification cycle protocols 
for 15 repetitions each were used to complete a total of 30 amplification cycles per sample. The cycle for 
the first 15 repetitions was 15 seconds at 96* C, 15 sec at 65* C and 30 seconds at 72* C + 2 
45 seconds/cycle extension. The cycle for the second 15 repetitions was 15 seconds at 96* C, 15 sec at 55* 
. C and 60 seconds at 72 • C + 2 seconds/cycle extension. 

Example 16 

so Preparation of streptavidin agarose 

Streptavidin agarose (Gibco BRL, Bethesda MD) was treated before use. The agarose was washed 
three times with 200 ul of 150 mM NaCI, 10 mM NaH 2 PO* pH 7.2 containing 0.05% NaN 3 and 10% 
acetonitrile. A 25 ul aliquot of 0.5 M NaCI in 80% aqueous acetic acid containing 0.5% (w/v) 
55 aminoethanethiol hydrochloride was then washed through the resin and then 100 ul of this solution was 
allowed to react with the streptavidin agarose for 1 nr. The resin was again washed three times with 100 ul 
of 150 mM NaCI, 10 mM NaH 2 PO* pH 7.2 containing 0.05% NaN 3 and 10% acetonitrile. 
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Example 17 



TO 



Affinity purification and isolation of PCR amplified nucleic acids 

The use of 5'-modified oligonucleotides for the purification of polymerase chain reaction products was 
demonstrated. A 500 base pair segment of bacteriophage lambda DNA was amplified using various 
combinations of biotinylated or natural (unmodified) primers. Amplification products from reactions contain- 
ing the biotinylated primers were demonstrated to be selectively retained by a streptavidin agarose support. 
Exposure of the support to conditions known to cleave the 5'-biotinylated trityl group from the DNA allowed 
recovery of the 500 base pair fragment in its unmodified form. 

PCR reactions were carried out as described in Example 15 using the Control PCR Primers 1 and 2 
and Bio-PCR Primers 1 and 2 from Example 9. Table 4 describes the various combinations of primers and 
conditions used in the individual reactions. 



75 



Table 4 



20 



25 



30 



35 



Primers and conditions used for PCR reactions. 


Reaction 


Primers 


Buffer 


Polymerase 


1 


Control PCR Primer 1 
Control PCR Primer 2 


Tris 




2 


Control PCR Primer 1 
Control PCR Primer 2 


Heppso 




3 


Control PCR Primer 1 
Control PCR Primer 2 


Tris 


+ 


4 


Control PCR Primer 1 
Control PCR Primer 2 

- 


Heppso 


+ 


5 


Bio-PCR Primer 1 
Control PCR Primer 2 


Heppso 


+ 


6 


Control PCR Primer 1 
Bio-PCR Primer 2 


Heppso 


+ 


7 


Bio-PCR Primer 1 
Bio-PCR Primer 2 


Heppso 


+ 



Following amplication, a portion of each PCR reaction (50 ul) was incubated with pretreated (see 

40 Example 16) streptavidin agarose (100 ul) for 30 minutes. At the end of the incubation, the supernatant was 
removed and the streptavidin agarose was washed twice with 100 ul of 150 mM NaCI. 10 mM NahfePCU pH 
7.2 containing 0.05% NaNa and 10% acetonitrile (hereafter called streptavidin buffer).The washings were 
combined with the supernatant. These samples are the "support eluent" and contain any material which 
does not bind to the streptavidin support. The streptavidin agarose was again washed twice with 100 ul 

45 streptavidin buffer and these aliquots were discarded. 

Material that had bound to the streptavidin agarose by the biotin trityl linker was then recovered by 
exposure of the support to acid. A 25 ul aliquot of 0.5 M NaCI in 60% aqueous acetic acid was washed 
through the resin bed and saved. The resin was then treated for 2 hours with 100 ul of 0.5 M NaCI in 80% 
aqueous acetic acid at 4*C. The supernatant was recovered and combined with the initial acid wash. The 

50 support was then washed three times with 100 ul of a solution containing 0.25 mM Tris pH 9.0, 0.15 mM 
NaCI and 0.05% sodium azide (w/v) in water/acetonitrile, 8/2 (v/v). The acid fractions and washings were 
combined and hereafter are referred to as the "acid eluent" samples. These samples contain material which 
can be removed from the support by conditions known to cleave the trityl ether bond. 

To demonstrate that selective adsorption and recovery of biotinylated PCR generated DNA fragments 

55 was achieved, the PCR reactions, "support eluent" samples and "acid eluent" samples were analyzed by 
gel electrophoresis. Prior to the analysis "support eluent" samples and "acid eluent" samples were 
concentrated to dryness and dissolved in 50 ul of sterile water. Salt and excess primer were removed by 
applying each sample to a Sephadex G-50 spin column in the manner described by the manufacturer 
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(Bosh ringer Mannheim, Indianapolis, IN). These samples were again evaporated to dryness and dissolved in 
50 ul of sterile water. Aliquots of 10 ul were then applied to a 2% agarose electrophoretic gel and 
compared directly with PCR reaction products. 

Rgure 4 is a photograph of the ethidium bromide stained electrophoretic gel obtained from the analysis 
5 of the various samples. 

Lanes A and B contain aliquots of control PCR reactions 1 and 2 in which all the components of the 
reactions, as described in Example 15, were added except the polymerase. As expected, no amplification is 
seen. Lanes C and T contain a commercially available DNA size marker (123 bp ONA marker, Gibco BRL, 
Bethesda, MD) and Lane I contains another commercially available DNA size marker (DNA Marker V t 
w Boehringer Mannheim, Indianapolis, IN). 

Lanes D-H contain the PCR products from reactions 3-7 of Table 4, respectively. In each case a well 
defined band corresponding to the 500 base pair DNA fragment is present demonstrating that efficient 
amplification was achieved with the various combinations of buffers and unmodified or biotinylated primers. 

Analysis of material in the "support eluent" is shown in lanes J-N which correspond to reactions 3-7, 
is respectively. The presence of the bands in lanes J and K demonstrates that amplification products from 
reactions that contain unmodified primers are not retained by the streptavidin agarose. The absence or 
marked decrease of the fragment in lanes L-N (from reactions 5-7) shows that DNA derived from the 
biotinylated trityl primers is bound by the support. 

Recovery of bound biotinylated fragments was achieved by acid treatment of support bound DNA as 
20 shown in lanes O-S (from reactions 3-7). Lanes O and P correspond to the acid eluent derived from 
reactions 3 and 4, respectively. No DNA is present because these reactions contained unmodified primers 
and thus no DNA was bound to the streptavidin agarose. Lanes OS contain the acid eluent from reactions 
5-7, respectively. The presence of the 500 base pair fragment in lanes OS demonstrates that 5'-modified 
support bound DNA can be recovered upon exposure of the support to conditions known to cleave the 
25 modified trityl group. 

Equivalents 

Those skilled in the art will recognize or be able to ascertain, using no more than routine experimenta- 
30 tion, many equivalents to the specific embodiments of the invention described herein. These and all other 
equivalents are intended to be encompassed by the following claims. 

Claims 

35 1. A compound represented by the formula: 
L-P-CT 

wherein C* is a natural product, biopolymer or synthon for a biopolymer or natural product; 
40 P is a protecting group bound to a functional group of C" which can be removed from the protected 
functional group under conditions such that the functional group is regenerated; and 
L is a functionality for bonding a modifying moiety to P, provided that L is not a phenol (hydroxy!) 
group or an arylamino group. 

45 2. The compound of Claim 1 , wherein C* is selected from the group consisting of a nucleoside, nucleotide, 
oligonucleotide, nucleic acid, amino acid, peptide, protein, monosaccharide, oligosaccharide, carbohy- 
drate, lipid, steroid and alkaloid. 

a The compound of Claim 1 1 wherein P is a protecting group selected from the group consisting of trityl, 
so 4-monomethoxytrityl, 4,4'-dimethoxytrityl, 9-phenylxanthene-9-yl, 9-fluorenylmethyloxy-carbonyl, t- 
butoxycarbonyl, benzyl, benzyloxycarbonyl, alkoxybenzyl. benzyloxymethyl, alkoxymethyl, alkylsilyl, 
arylsilyl, benzoyl, phenoxyacetyl and alkoxyacetyl; wherein P can be optionally substituted. 

4. A compound represented by the formula: 

55 

M-L-P-C* 

wherein C* is a natural product, biopolymer or synthon for a biopolymer or natural product; 
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P is a protecting group bound to a functional group of C\ which can be removed from the protected 
functional group under conditions such that the functional group is regenerated; 
M is a modifying moiety selected from the group consisting of a detection label, a biologically active 
molecule and a compound for aiding in the purification or immobilisation of C*, provided that M is not 9- 
s fluorenylmethoxycarbonyl (Fmoc). levulinoyl or 4,5-dichlorophthalimido and 

L is a functionality for bonding M to P, provided that L is not a phenol (hydroxyl) group or an aryhamino 
group. 

5. The compound of Claim 4, wherein C* is selected from the group consisting of a nucleoside, nucleotide, 
w oligonucleotide, nucleic acid, amino acid, peptide, protein, saccharide, oligosaccharide, carbohydrate, 

lipid, steroid and alkaloid. 

6. The compound of Claim 4, wherein M is selected from the group consisting of a radioisotope, 
fluorophore, luminescent compound, chemi luminescent compound, btotin, peptide, protein, nucleoside, 

is nucleotide, oligonucleotide, nucleic acid, saccharide, oligosaccharide, carbohydrate and polymer. 

7. The compound of Claim 4, wherein P is a protecting group selected from the group consisting of trityl, 
4-monomethoxytrityl, 4,4 , -dimethoxytrityl, 9-phenylxanthene-9-yl. 9-fluorenylmethyloxy-carbohyl. t- 
butoxycarbonyl, benzyl, benzyloxycarbonyl, benzyloxy methyl, alkoxycarbonyl, alkoxy methyl, alkylsilyl, 

20 arylsilyl, benzoyl, phenoxy acetyl and alkoxy acetyl; wherein P can be optionally substituted. 

8. A compound represented by the formula: 




wherein A'i - A'is are the same or different and are selected from the group consisting of 
40 H, R, OR, Z and L, provided that there is at least one L group; 

R is an alkyl group having one to 20 carbon atoms which may optionally contain a heteroatom; or a 
substituted or unsubstituted aryl; 

Z is -(CH2) n C(0)OH, -(CH2)„S0 3 H, -(CH^NCfe. -(CH2) n CN, -(CH^OH, -(CH 2 ) n NH 2 and -<CH2)„SH, 
where 

45 n is an integer from zero to 20, provided that when n is zero, then Z is a group other than OH or -NH 2 ; 
L is -(CH2)„C(0)W, -(CH 2 ) n S02VV, -(ChkJnW, where n is an integer from zero to 20 and 
X is a leaving group selected from CI, Br, I and BF* and 
W is selected from the group of CI, Br, I, -NCS, -NCO. 

so 
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9. A compound represented by the formula: 



5 



10 




15 

wherein A*i - A'is are the same or different and are selected from the group consisting of H, R, OR, Z 
and L, provided that there is at least one L group; 

R is an alky I group having one to 20 carbon atoms which may optionally contain a hetero-atom; or a 
20 substituted or unsubstituted aryt; 

Z is -(CH 2 )„C(0)OH, -<CH 2 )„S0 3 H, -(CH2)„N02. -(CH 2 )„CN, -(CH 2 )„OH, -(CH 2 )„NH2 and -(CH 2 )„SH, 
where n is an integer from zero to 20, 

L is -(CH 2 )„C(0)W, -(ChfeJnS^ W, -(CH 2 ) n W, where n is an integer from zero to 20; and 
W is selected from the group consisting of CI, Br, I, -NCS, - NCO, 

25 



30 




and D1-D5 are the same or different and from H, F, CI, Br, I, NO2 and CN. 

35 

10. The compound of Claim 9, wherein C* is a nucleoside represented by the formula: 



40 




K G 



45 

wherein B is a nucleoside base which may be protected by a base protective group which can 

eliminated; 

G is H, OH, OR. 

50 * 

55 

K is H. OH. 
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R 



and R is an alky I group having one to 20 carbon atoms which may optionally contain a heteroatom, or a 
substituted or unsubstrtuted aryl. 

11. The method of reversibly modifying a natural product, biopolymer or synthon for a natural product or 
biopolymer, comprising the steps of 

a) coupling a natural product, biopolymer or synthon to a protecting group represented by the 
formula: 



wherein P is a protecting group for protection of a functional group of the natural product, 
biopolymer or synthon for a natural product or biopolymer which can be removed from the protected 
functional group under conditions such that the functional group is regenerated; L is a functionality 
for bonding a modifying moiety to P and 
b) coupling a modifying moiety to L. 

12. A method for reversibly modifying a polynucleotide, comprising the steps of: 

a) coupling a modified protected nucleoside beta-cyanoethyl phosphoramidite to the S'-terminal 
hydroxyl group of the polynucleotide, wherein the nucleoside phosphoramidite is represented by the 
structure: 



wherein P is a nucleoside protecting group for protection of the 5'-hydroxyl group, capable of being 
removed under conditions such that the 5' -hydroxy I group is regenerated; 
L is a functionality for bonding a modifying group; 

B is a nucleoside base which may be protected by a protective group which can be eliminated; 
G is H, OH, OR, 



L-P 




K G 




K is H, OH. 
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and; 

R is an alky I group having one to 20 carbon atoms which may optionally contain a hetero atom, or a 
substituted or unsubstituted aryl, 
b) coupling a modifying moiety to L 

Pate ntansprtl che 

1. Verbindung der Formel: 
L-P-C* 

worin bedeuten: 

C* ein natUrliches Produkt, Biopolymer Oder Synthon fUr ein Biopolymer Oder natUrliches Produkt, 
P eine an eine funktionelle Gruppe von C* gebundene Schutzgruppe, die von der geschOtzten 
funktionellen Gruppe unter derartigen Bedingungen, dafi die funktionelle Gruppe regeneriert wird, 
entfemt werden kann, und 

L eine FunktionalitSt zur Bindung einer modifizierenden Einheit an P. vorausgesetzt, daB L nicht fUr eine 
Phenol (hydroxyl) gruppe oder eine Arylaminogruppe steht. 

2. Verbindung nach Anspruch 1, wobei C* aus der Gruppe Nucleoside, Nucleotide, Oligonucleotide, 
NucleinsSuren, AminosMuren, Peptide, Proteine, Monosaccharide, Oligosaccharide, Kohlenhydrate, Lipi- 
de, Sterotde und Alkaloide ausgewShlt ist. 

3. Verbindung nach Anspruch 1, wobei P fUr eine Schutzgruppe steht, die aus der Gruppe Trityl, 4- 
Monomethoxytrityl, 4,4 , -Dimethoxytrityl, 9-Phenylxanthen-9-yl, 9-Fluorenylmethyloxycarbonyl, tert.-Bu- 
toxycarbonyl, Benzyl, Benzyloxycarbonyl, Alkoxybenzyl, Benzyloxy methyl, Alkoxymethyl, Alkylsilyl. 
Arylsilyl, Benzoyl, Phenoxyacetyl und Alkoxyacetyl ausgewa"hlt ist, wobei P gegebenenfalls substituiert 
sein kann. 

4. Verbindung der Formel: 
M-L-P-C* 

worin bedeuten: 

C* ein natUrliches Produkt, Biopolymer oder Synthon fUr ein Biopolymer oder natUrliches Produkt, 
P eine an eine funktionelle Gruppe von C* gebundene Schutzgruppe, die von der geschUtzten 
funktionellen Gruppe unter derartigen Bedingungen, dafi die funktionelle Gruppe regeneriert wird, 
entfemt werden kann, 

M eine modifizierende Einheit, die aus der Gruppe Nachweismarkierungen, biologisch aktive MolekUle 
und Verbindungen zur UnterstUtzung einer Reinigung oder Immobilisierung von C* ausgewahlt ist, 
vorausgesetzt, dafi M nicht fUr 9-Ruorenylmethoxycarbonyl (Fmoc), Levulinoyl oder 4,5-DichlorphthaIi- 
mido steht. und 

L eine Funktionalitat zur Bindung von ManP, vorausgesetzt, dafi L nicht fOr eine Phenol(hydroxyl)- 
gruppe Oder eine Arylaminogruppe steht. 
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5. Verbindung nach Anspruch 4, wobei C* aus der Gruppe Nucleoside, Nucleotide, Oligonucleotide, 
Nucleinsduren, Aminosauren, Peptide, Proteine, Saccharide, Oligosaccharide. Kohlenhydrate, Lipide, 
Steroide und AlkaJoide ausgewahlt ist 

5 6. Verbindung nach Anspruch 4, wobei M aus der Gruppe Radioisotope, Fluorophore, lumtneszierende 
Verbindungen, chemilumineszierende Verbindungen, Biotine, Peptide, Proteine, Nucleoside, Nucleotide, 
Oligonucleotide, Nucleinsduren, Saccharide, Oligosaccharide, Kohlenhydrate und Polymere ausgewahlt 
ist. 

10 7. Verbindung nach Anspruch 4, wobei P fUr eine Schutzgruppe stent, die aus der Gruppe Trityl, 4- 
Monomethoxytrityl, 4,4 , -Dimethoxytrityl, 9-Phenylxanthen-9-yl, 9-Fluorenylmethyloxycarbonyl, tert.-Bu- 
toxycarbonyl, Benzyl, Benzyloxycarbonyl, Benzyloxymethyl, Aikoxycarbonyl, Alkoxymethyl, Alkylsilyl, 
Arylsilyl, Benzoyl, Phenoxy acetyl und Alkoxy acetyl ausgewahlt ist. wobei P gegebenenfalls substituiert 
sein kann. 

15 

a Verbindung der Formel: 



20 



25 



30 




worm A'i bis A'is gleich oder verschieden sind und aus der Gruppe H, R, OR, Z und L ausgewahlt 
sind, vorausgesetzt, daB mindestens eine Gruppe L vorhanden ist, 
35 R fUr eine Alkylgruppe mit 1 bis 20 Kohlenstoffatom(en), die gegebenenfalls ein Heteroatom enthalten 
kann, Oder ein substituiertes oder nicht substituiertes Aryl steht, 

Z -(CH2) n C(0)OH, -(ChfcJnSOaH, -(ChfeJnNO^ -(CHOnCN, -(CH 2 ) n OH. -(CH 2 ) n NH 2 und-(CH 2 ) n SH bedeu- 
tet, wobei 

n eine ganze Zahl von 0 bis 20 bedeutet, vorausgesetzt, daB bei n = 0 Z fOr eine von OH oder -NH 2 
40 verschiedene Gruppe steht, 

L -(CKfeJnCfOJW, -(C^JnSCbW, -(CH 2 ) n W darstellt. wobei n fUr eine ganze Zahl von 0 bis 20 steht. und 
X eine aus CI, Br. I und BF 4 ausgewahlte Abgangsgruppe bedeutet und 
W aus der Gruppe CI, Br, I, -NCS, -NCO ausgewShlt ist. 



45 
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9. Verbindung der Formel: 




worin A'i bis A' 15 gleich Oder verschteden sind und aus der Gruppe H, R, OR, Z und L ausgewShlt 
sind, vorausgesetzt, dafi mindestens eine Gruppe L vorhanden ist, 

R eine AJkytgruppe mit 1 bis 20 Kohlenstoffatom(en), die gegebenenfails ein Heteroatom enthalten 
kann, oder ein substituiertes Oder nicht substituiertes Aryl bedeutet, 

Z -(CH2)„C(0)OH, -(CH2) n S0 3 H t -(CH^NCfe, -(CH^nCN, -(CH^OH, -(CH 2 )„NH2 und -(CH 2 ) n SH dar- 

stellt, wobei n fOr eine ganze Zahl von 0 bis 20 stent, 

L -(CH 2 )„C(0)W, -<CH2)„S02W, -(CH 2 )„W entspricht, wobei 

n ftlr eine ganze Zahl von 0 bis 20 stent, und 

W aus der Gruppe CI, Br, I, -NCS, -NCO. 




ausgewShlt ist und 

D1-D5 gleich oder verschieden sind und aus H, F, CI, Br, I, NO2 und CN (ausgewShlt sind). 
10. Verbindung nach Anspruch 9, wobei C* ein Nucleosid der Formel: 



K C 



worin B fOr eine Nucleosidbase steht, die durch eine eliminterbare Basenschutzgruppe geschUtzt sein 
kann. G H, OH, OR, 



-V- 
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bedeutet, 
K H, OH. 




entspricht und 

R fUr eine Alkylgruppe mit 1 bis 20 Kohlenstoffatom(en), die gegebenenfalls ein Heteroatom enthaJten 
kann, oder ein substituiertes oder ntcht substituiertes Aryl steht, bedeutet 

11. Verfahren zur reversibten Modifizierung eines natUrlichen Produkts, Biopolymers Oder Synthons fUr ein 
natUrliches Produkt oder Biopolymer, umfassend die folgenden Schritte: 

a) Kuppeln eines natUrlichen Produkts, Biopolymers Oder Synthons an eine Schutzgruppe der 
Formel: 



L-P 



worm P fUr eine Schutzgruppe zum Schutz einer funktionellen Gruppe des natUrlichen Produkts. 
Biopolymers oder Synthons fUr ein natUrliches Produkt oder Biopolymer steht, die von der geschUtz- 
ten funktionellen Gruppe unter derartigen Bedingungen, dafi die funktionelle Gruppe regeneriert wird, 
entfernt werden kann, L eine FunktionalitSt zur Bindung einer modiftzierenden Einheit an P bedeutet, 
und 

b) Kuppeln einer modiftzierenden Einheit an L. 

12. Verfahren zur reversiblen Modifizierung eines Poly nucleoids, umfassend die folgenden Schritte: 

a) Kuppeln eines modiMzierten geschUtzten Nucleosid-beta-cyanoethylphosphoramidits an eine 5'- 
terminale Hydroxy Igruppe des Polynucleotids, wobei das Nucleosidphosphoramidit der folgenden 
Struktur entspricht: 



K G 



worin P fUr eine Nucleosidschutzgruppe zum Schutz der 5'-Hydroxylgruppe steht, die unter derarti- 
gen Bedingungen, dafi die 5'- Hydroxy Igruppe regeneriert wird, entfernt werden kann, 
L eine FunktionaJitat zur Bindung einer modifizierenden Gruppe bedeutet, 

B eine Nucleosidbase darstelft, die durch eine eliminierbare Schutzgruppe geschUtzt werden kann, 
G H, OH, OR, 
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bedeutet, 
K H, OH, 




entspricht und 

R fOr etne AJkylgruppe mit 1 bis 20 Kohlenstoffatom(en), die gegebenenfalls ein Heteroatom 
enthalten kann, oder ein substituiertes Oder nicht substituiertes Aryl steht, und 
b) Kuppeln einer modifizierenden Einheit an L. 

Revendlcations 

1. Un compost represents par la formute : 
L-P-C* 

dans laquelle C* est un produit naturel, un biopolymere ou un synthon pour un biopolymere ou un 
produit naturel ; 

P est un groupe de protection lie* a un groupe fonctionnel de C" qui peut §tre supprime' du groupe 
fonctionnel protege dans des conditions telles que le groupe fonctionnel est r6g6n6t6 ; 
et L est une fonctionnalite pour lier une motfe* de modification a P, avec la condition que L n'est pas un 
groupe phenol (hydroxyle)) ou un groupe aryle-amino. 

2. Le compost de la revendication 1, dans lequel C* est seiectionne dans le groupe constitue par un 
nucleoside, un nucleotide, un oligonucleotide, I'acide nucieique, un acide amino, un peptide, une 
proline, le monosaccharide, I'oligosaccharide, un carbohydrate, une lipide, un steVotte et un alcaloTde. 

3. Le compost de ia revendication 1, dans lequel P est un groupe de protection seiectionne dans le 
groupe constitue par le trityle, le 4~monom£thoxytrityle, le 4,4' -dim£thoxytrityle, le 9-phenylxanthene-9- 
yle, le 9-fluor€nylm6thyloxy-carbonyle, le t-butoxycarbonyJe, le benzyle, le benzyloxycarbony- 
le.ralkoxybenzyle, le benzyloxymethyle, I'alkoxymethyle, TaJkylsilyle, I'arylsilyle, le benzoyle, le ph6- 
noxyacetyle et I'aJkoxyacetyle ;P pouvant etre eventuellement substitue. 

4. Un compose" represents par la formule : 
M-L-P-C* 

dans laquelle C* est un produit naturel, un biopolymere ou un synthon pour un biopolymere ou un 
produit naturel ; 
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P est un groups de protection lie k un groupe fonctionnel de C*. qui peut §tre supprime du groups 
fonctionnel protege dans des conditions telles que Id groupe fonctionnel est reggngrg; 
M est une moitie de modification se lection r\6e dans le groupe constitue par une Etiquette de detection, 
une molecule biologiquement active et un compose* pour aider a la purification ou rim mobilisation de 
C*. avec la condition que M n'est pas du 9-fluorenylmethoxycarbonyle (Fmoc), du leVufinoyle ou du 4,5- 
dichlorophthalimido et 

L est une fonctionnalrte' pour lier M a P, avec la condition que L n'est pas un groupe phenol (hydroxyle) 
ou un groupe aryle-amino. 

5. Le compose* de la revendication 4, dans lequel C* est seiectionne dans le groupe constitue par un 
nucleoside, un nucleotide, un oligonucleotide, I'acide nucieique, un acide amino, un peptide, une 
proline, le saccharide, I'oligosaccharide, un carbohydrate, une lipide, un steroTtde et un alcaloTde. 

6. Le compose de la revendication 4, dans lequel M est seiectionne dans le groupe constitue par un 
radioisotope, un fluorogene, un compose luminescent, un compose chimioluminescent, une biotine, un 
peptide, une proteine, un nucleoside, un nucleotide, un oligonucleotide, I'acide nucieique, le saccharide, 
('oligosaccharide, un carbohydrate et un polymere. 

7. Le compose de la revendication 4, dans lequel P est un groupe de protection seiectionne dans le 
groupe constitue par le trityle, le 4-monomethoxytrityle, le 4,4* -dimethoxytrityle, le 9-phenylxanthene-9- 
yle, le 9-fluorenylmethyloxycarbonyle, le t-butoxycarbonyle, le benzyle, le benzyloxycarbonyle, le 
benzyloxymethyle, I'alkoxycarbonyle, Talkoxymethyle, I'alkylsilyle, I'arylsilyle, le benzoyls, le phenoxya- 
cetyle et I'alkoxyacetyle ; P pouvant etre eventuellement substitue. 

8. Un compose represente par la formule : 




dans laquelle AS - A'i 5 sont les mimes ou sont difterents et sont seiectionnes dans le groupe 
constitue par 

H.R.OR.Z et L, avec la condition qu'il existe au moins un groupe L ; 

R est un groupe alkyle comportant 1 a 20 atomes de carbone qui peuvent eventuellement contenir un 
heteroatome ; ou un aryle substitue ou non substitue ; 

Z est -<CH2) n C(0)OH. -(CHs^SCfeH, -(CI^NCfe, -(CKfcJnCN. -(CH2) n OH, -(CHaJnNHa et -(CH 2 ) n SH. 

n etant un nombre entier de 0 a 20, avec la condition que, lorsque n vaut zero, alors Z est un groupe 

autre que OH ou -NH2 ; 

L est -(CH2)„C(0)W, -(CHjJnSCfcW. -(CH 2 )„W, n etant un nombre entier de 0 a 20, 

X est un groupe restant seiectionne parmi CI, Br, I et BF* , et 

W est seiectionne dans le groupe constitue par CI, Br, I, *NCS, -NCO. 
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9. Un compose" represents par la formule : 




dans laquelle A'i - A'is sont les m§mes ou sont differents et sont s€lectionn€s dans le groupe 
constitu£ par H.R.OR.Z et L, avec la condition qu'il existe au moins un groupe L ; 
R est un groupe alkyle comportant 1 a 20 atomes de carbone qui peuvent £ventuellement contenir un 
h6teVoatome ; ou un aryle substitue" ou non substitue* ; 

Z est -(CH 2 ) ft C(0)OH, -(CH 2 ) 0 S0 3 H. -<CH 2 ) n N02, -(CH 2 ) n CN, -(CH 2 ) n OH, -(CH 2 ) n NH2 et -(CH^SH. n 
£tant un nombre entier de 0 a 20. 

L est -(CH 2 ) n C(0)W, -(Chfe^SCfeW, -(CH 2 )„W, n 6tant un nombre entier de 0 a 20 ; et 
W est $£lectionn6 dans le groupe constitu6 par CI, Br, I, -NCS, - NCO, 




et Di -Ds sont les m§mes ou sont diffe>ents et sont s6lectionn6s parmi H,F,CI, Br.l, NO2 et CN. 
10. Le compose* de la revendication 9, dans lequel C* est un nucleoside represents par la formule : 




K G 



dans laquelle B est une base nucleoside qui peut §tre protegee par un groupe de protection de base 
qui.peut §tre €limin£ ; 
G est H. OH. OR, 



31 



EP 0 424 819 B1 




R 



K est H, OH. 



I I I 

o 9 o 



I 



I 

0=P-CR 0=P-OR 



BO" nT^s I i 

OH OR 



I i | 

? . ? ? 



I©- P ^N" R 0=P-H 0=P-CE 

R 00 OH 

et R est un groupe alkyle comportant 1 a 20 atomes de carbone qui peuvent eventuellement contenir 
un heteroatome, ou un aryle substitud ou non substitue. 

11. Le proc6d6 pour modifier de maniere reversible un produit naturel, un biopolymere ou un synthon pour 
un produit naturel ou un biopolymere, comportant les phases consistant a : 

a) coupler un produit naturel, un biopolymere ou un synthon a un groupe de protection represents 
par (a formule : 

L-P 

dans laquelle P est un groupe de protection pour la protection d'un groupe fonctionnel du produit 
naturel, du biopolymere ou du synhton pour un produit naturel ou un biopolymere qui peut §tre 
supprime* du groupe fonctionnel protege* dans des conditions telles que le groupe fonctionnel est 
r6g6n6r6 ; L est une fonctionnalite* pour lier une moiti6 de modification a P et 

b) coupler une moitie* de modification a L. 

12. Un procede pour modifier de manure reversible un polynucleotide, comportant les phases consistant a 

♦ 

a) coupler un phosphoramidrte beta-cyan oethyle nucleoside protege modifie au groupe hydroxyle 
terminal 5' du polynucleotide, le phosphoramidite de nucleoside etant represente par la formule : 



L — p— O 



K G 



dans laquelle P est un groupe de protection de nucleoside pour la protection du groupe hydroxyle 
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5*. capable d'etre supprime" dans des conditions telles que le groupe hydroxyle 5' est r§g6n6re* ; 
L est une foncttonnalite* pour iier un groupe de modification ; 

B est une base nucleoside qui peut §tre prot£g£e par un groupe de protection qui peut §tre ^limine" ; 
G est H, OH. OR. 

K est H.OH, 




et; 

R est un groupe alkyle comportant 1 a 20 atomes de carbone qui peuvent eVentuellement contenir 
un h6t£roatome, ou un aryle substituS ou non substitug, 
b) coupler une moitie* de modification a L. 
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Figure 2 
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